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I. INTRODUCTION 


This thesis models foreign policy decisions using a 2 x 2 decision uUndemeam 
matrix with interval scaled pavoifs. The general application concerns two nations with 
a conflict of interest situation existing between them. Conflict of interest situations will 
continue to arise as a result of the ideological differences that exist in the world today. 
These are part of the overall East vs. West protracted conflict as characterized bv 
Strause-Hupe, Kintner, Dougherty and Cottrell in their book Protracted Conflict. “The 
United States in this atmosphere will be repeatedly confronted with the enigma of 
making a decision in the face of uncertainty.” [Ref. |] Foreign policy decisions involve 
deciding what course of action is the most appropriate, given the evidence at hand. The 
courses of action available could be considered either peaceful or aggressive actions. 
The evidence at hand could be the behaviors the antagonist 1s presenting exhibiting. Are 
lis actions peaceful in the present situation or are they aggressive? 

A conflict of interest situation involving the United States and the Soviet Union 
could be considered as a simple 2 x 2 decision under risk matrix, where the United 
States, considered as the decision maker, would choose one of two alternatives 
(peaceful or aggressive action). The Soviet Union could be considered to have these 
Same two courses of action open to it. For the purposes of this thesis, all actions are 


considered to be one of these two alternatives. 


A. CATEGORIZING PEACEFUL OR AGGRESSIVE ACTIONS 
Grouping the alternative courses of action into the appropriate category of 
peaceful or aggressive should be a fairly easy task. For example, during thesSenm 
crisis. troop and combat aircraft buildups in the vicinity of Berlin by the Soviet Union 
might be interpreted by the Unmited States as the U.S.S.R. pursuing a oma 
(aggressive) course of action. On the other hand, the U.S.S.R.’s permitting American 
aircraft the use of prescribed air corridors while in-flight to Berlin might be interpreted 
to mean that the U.S.S.R. is pursuing a cold-svar (peaceful) course of action. 
Historical examples of exercising a hot-war course of action by the United Staies 
might include: 
e the landing and liberation of Grenada by U.S. Armed Forces, 
e the sending of Marines into Lebanon. 


¢ and the Libvan bombings. 


Examples of Cold-war alternatives might include: 
e the banning of trade with China after the Communist takeover, 
e the SALT Talks, 


e and any passive act including the withdrawal of military forces or military aid. 


B. A DECISION UNDER RISK 

Two elements tn the structure of a decision are the set of alternatives available to 
the decision maker and a set of factors (over which he has no control) influencing the 
@ecision Outcome. [hese latter elements are called futures or future states of nature. A 
decision under cerrainty is one in which we assume that one and only one future state 
will occur. A decision under uncertainty 1s one in which we are unable to estimate the 
probabilities of these events. A decision under risk is one in which we can estimate the 
probabilities of these future states and more than one of them 1s greater than Zero. 
beef 2) The relative pavoffs for different outcomes in this 2 x 2 decision under risk 
fiers) are Not the same for the United States and the Soviet Union. Therefore, this 
situation cannot be viewed as a two-person zero-sum game. However, the conflict of 
interest situation could be viewed as a statistical decision problem if the frequency of 
employment of peaceful (or aggressive) actions by the Soviet Union was regarded as a 
random state of nature. If the proportion of actions that are peaceful by the U.S.S.R. 
could be described probabilistically and the advisors in the State Department and the 
Joint Chiefs of Staff could estimate that distribution, then a good or reasonable 
estimate could provide the President with a statistical foundation for making a 
Mecsion. Ihe decision maker, in order to reduce the risks of selecting alternatives, 
wouid hke to know what proportion of actions by the Soviet Union are peaceful. 

iemmecisiOn chosen 167 a course of action by the United States could be 
determined using a decision rule that depended in part the probability distribution that 
was used to describe the Soviet Union’s (or any antagonist’s) past behavior. This 
probaoility distribution might initially be a subjective one, with which the United 
States (or any protagonist) conjectured as to best describing the antagonist’s behavior. 


This probability distribution is Known as an @ priori probability distribution. 


C. A PRIORI TO A POSTERIORI PROBABILITY DISTRIBUTION 
After the establishment of the a priori probability distribution, the advisors could 
observe the conduct of the antagonist and interpret specific events as associated with 


one of the two possible courses of action. These observations may then be used to 


update the previously conjectured probability distribution. Bayes Theorem provides a 
method for updating the a priori probability distribution. In general terms, Baves 
Theorem uses an a priori probabilitv function and derives an a posteriori probability 
function given the sample values of a random variable. After the sample values are 
xnown, the @ posteriori probability function summarizes the a priori and sample 
information about the proportion of actions, by the antagonist, that are peaceful. 
Bayes Theorem, thus, improves the assessment of the uncertainty as toi 
conjectured course of action the antagonist is pursuing. The changed probabilities are 
known as a posteriori probabilities. These become the present estimates, by the 
protagonist, of the probabilities that the antagonist is using one or the other specific 
course of action. These a posteriori probabilities then become the a priori probabilities 


in further estimates of the antagonist’s behavior. 


D. DECISION RULES FOR THE PROTAGONIST 

Once an a posteriori probability distribution has been established, statistical 
decision theorv may then be used to choose objectively an appropriate alternative 
through the use of a decision rule that 1s based on the matrix payoffs and greatest 
expected value of return. This decision rule will assist the decision maker as to which 
Strategy tO pursue given any past history of the antagonist’s behavior. 

The particular ordering of outcomes in terms of preferences selected and the 
values that the protagonist assigns to these outcomes 1s an important step in the use of 
this model. As will be seen later in the general development of the decision rule, this 
process can be the sole deternuning factor by which the model proposed here will 
indicate an aggressive Or a peaceful action be taken by the protagonist. 

The model in this thesis asks a decision maker to choose an ordering, from the 
most desirable to the least desirable, of four possible outcomes which show how each 
course of action for the protagonist (the matrix rows) combines with a course of action 
for the antagonist (the matrix columns) into four possible outcomes. Then it asks him 
to quantify, on an interval scale, the desirabilitv of two of these outcomes relative to 
two standards. Next, a judgement is required stating which of many conjectured a 
priori probability distributions he believes is the best description of the antagonist s 
behavior. Based on the a priori distribution and observation of actual antagonist 
behavior an updated a posteriori probability distribution is established. Given the a 
posteriori probability distribution. which will not be the same for different occurrences 
of antagonist behavior, a decision rule is derived based on greatest expected value of 


gel nmele 
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Eo bests OUTLINE 

Theory and general model development will occur over the next five chapters. 
Chapter II will discuss the specific categorizations of actions. It will show how these 
actions are combined into outcomes (or states) in what will be called a conflict 
situation matrix. Chapter II will also include an explanation of the ranking and 
quantification of these outcomes. Chapter III will discuss the optimal solutions to the 
decision problem represented by the conflict situation matrix. Chapter [V will show 
how to estimate the antagonist’s behavior using a conjectured a priori probability 
distribution and Baves Theorem. Chapters V and VI will show the difference between 
using a discrete or a continuous a priori probability distribution and the decision rules 
that result. Chapter VII contains specific examples. Chapter VIII contains concluding 
remarks. We turn now to a discussion of the general two alternative, two states of 


nature conflict situation matrix that is the basis for the model in this thesis. 


Lt 


Il. CONFLICT SITUATION MATRIX 


This chapter begins with a definition of the two alternative courses of action 
available to the protagonist and the antagonist. We will see how these strategies result 
in four possible outcomes or states, in a conflict situation matrix. to which the decision 
maker assigns uulitv values. This chapter will conclude with a discussion of how these 


assigned utility values relate to each other on an interval scale. 


A. ALTERNATIVE COURSES OF ACTION 

Imagine a situation involving a conflict of interests between two nations 
designated as the protagonist (P) and the antagonist (A). The protagonist will be the 
nation making the decision in the face of uncertainty. The protagonist has a set of two 
possible courses of action that it can pursue. The set of possible courses of action for 
the antagonist also consists of two different actions. These courses of action are 
denoted as P, or P,, for the protagonist, and A, or A, for the antagonist. Let the 


courses of action for the nations involved have the following meaning: 


P.- cold war, non-aggressive or peaceful action, 


P, - hot war, aggressive or violent action, 


A. - cold war, non-aggressive or peaceful action, 


Ay - hot war, aggressive or violent action. 


The four possible situations which can then occur as the nations involved exercise their 


options may be denoted as: 
dsae\ reser ee) and (P. Ay). 


The pair (P.,A,) describes the condition of both nations pursuing peaceful actions 
towards each other. The pair (P.,.A;,) denotes the protagonist displaying either peaceful 
Or non-vioient actions towards the aggressor while the antagonist is displaying 
aggressive or hor-war like actions. The pair (P),A;,) is the reverse of the previous 


situation and (Pp,A}) is both nations acting fol-war like or displaving aggressive 


acuons towards each other. These two alternative courses of action for both the 
protagonist and the antagonist could be represented as a 2 x 2 matnx. The two 
columns are the two possible behaviors the antagonist can exhibit, and the two rows 
would be the two alternative courses of action available to the protagonist. The four 
cells of this matrix are the four outcomes that can occur. A generalized representation 


of this conflict situation matrix is shown in Figure 2.1. 


Figure 2.1 Conflict Situation Matrix. 


B. RANKING OF OUTCOMES 

A decision maker must select an alternative act, course of action or strategy from 
a recognized set of decision alternatives. The connection between decisions and 
rankings of outcomes (or preferences) in anv decision scenario 1s made by assuming 
that preferences are the driving force behind decision choices. That is, it is assumed a 
decision maker would rather implement a course of action that led to a more preferred 


outcome over one that led to a less preferred outcome. [Ref. 3] 
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Suppose for a moment that the preferences chosen, by the protagonist, for the 


four possible outcomes are represented as 
lesa) > Cae.) > Sea, PAu ey sh 


with the meaning that the situation to the left of the symbol ” > ” is preferred. by the 
protagonist, to the situation to the right of the svmbol. and the notation, V(_), is the 
value of that outcome. These and all orderings of outcomes, bv the protagonist, shall 
be subject to the general requirements of utility theory in that they are transitive. That 
is, if A > Band B > C then A > C or if the protagonist is indifferent between A and 
B and is indifferent between B and C then he is indifferent between A and) Guage 
establishes the notion of coherence in decision theory or transitivity in the laws of 
mathematics. [Ref. 4: p. 74] 

Notice that rankings, other than the one suggested above, could occur) iiieae 
may be situations, in the eves of the decision maker, where being at war while his 
antagonist is not, (P,,A,), is preferred to both being at peace, (P,,A,), so that 
V(Py.A,) is greater than V(P.,A,). Another possibility may arise in the case of a 
political decision maker looking for justification of aggressive maneuvers against a less 
capable antagonist. He mught prefer the antagomist to take aggressive actions towards 
him first. (P..Ap), to all other outcomes, thus justifying a preference of (P;,Ay,), to all 


other outcomes later on. 


C. UTILITY VALUES FOR THE OUTCOMES 

Once an ordering of possible outcomes or preferences has been selectedma 
quantification of those outcomes needs to be determined by the protagonist in order to 
work within the framework of this model. As described in Fundamentals of Decision 
Analysis by Irving H. LaValle, (Ref. 4: p. 72] utility values assigned reflect the relative 
desirabilities of the possible outcomes. A generalized utility value assignment of the 


outcomes in this model might be indicated as follows: 


V(P.A,) = + 1.0 


V(Py Ap) = - 1.0 
MAE ee) =a 


eee! = hb, 
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Note that this utility scale uses two specific outcomes as references, whereas the usual 
procedure assigns the value of zero to the least preferred outcome and a value of one 
omerdem most prefered OUuccome. The two representations are merely linear 
transformations of each other and are equivalent in information content. [Ref. 5] The 
meaning of the value of + 1.0 attached to V(P..A,) is a standard of preference for a 
desirable situation -- both nations at peace. The value of - 1.0 attached to Veen 1S 
a standard of preference for an undesirable situation -- both nations at war. In the 
example considered in the previous section a has a value less than - 1.0 and b has a 
value less than + 1.0 and greater than - 1.0. More succinctly, for this example of 


forererences: 


ition \) = Vib ea) = V(Py Ay) = NACE eee 
OF 
ete eee er eee (ee a 


The assignment of reference values to the outcomes (P.,A,) and (P},,A;,) of + 1.0 and 
- 1.0 forces the V(P..A,) to always be greater than V(P,.A,) in this model. This 
modeling restriction allows 12 possible orderings of outcomes. Which ordering of 
outcomes the protagonist selects will ultimately affect the alternative course of action 
he should pursue to gain his greatest expected value of return. The utility values a and 
b can be any arbitrary real numbers for the situations (P..A;,) and (P,,A,). Each 
numerical value is the Uvriliry to the protagonist of each possible situation. A 
generalized matrix representation of the conflict situation with associated utility values 


assigned bv the protagonist would appear as previously illustrated in Figure 2.1. 


D. INTERVAL SCALING 

The utility values of the conflict situation matrix are on an interval scale. Interval 
scales have no natural origin. There is a zero point on this scale but it is a defined 
point, while a natural origin would indicate the absence of the property being 
measured. [Ref. 6] If V; is the utility value of ith outcome, then transformations of the 
type Y = m(V;) + ¢ (linear with a positive slope) are allowed. Due to the lack of a 
natural origin on the interval scale, one value of V, cannot be said to be a proportion 
of another value on that scale. When using interval scales, one mav onlv speak of 


relative differences. For example. the decision maker would, after quantification, want 


to be able to say that he believes that one outcome, relative to the worst outcome is p 
times as good as another outcome relative to the worse. In other words, if V; is the 
value for outcomes i = 1,2,3 or dand V, > V, > V, > V4. It could then be stated 


that 


This would imply that the difference between outcomes V, and V, is p times as good 
as the difference between outcomes V, and V,. This is the basic information contained 
in an interval scale. 

In this chapter we have introduced the conflict situation matrix (Figunemzae 
showing the possible courses of actions available, their resulting outcomes and the 
concept of utility values associated with these outcomes. The utility values assigned are 
rated on an interval scale relative to two standards of preference. In Chapter III we 
Wil discuss the optimal solutions to the conflict situation matrix using dominance and 
greatest expected value of return when the decision is made with Knowledge about the 


proportion of antagonist actions that are peaceful. 


Il. OPTIMAL SOLUTIONS WITH CERTAINTY ABOUT THE 
PROPORTION OF ANTAGONIST ACTIONS THAT ARE PEACEFUL 


The uncertainty in anv decision making process may usually be expressed by 
saying that we do not know what event will occur on some future occasion, where 


events are defined as subsets of a sample space. [Ref. 7] 





Figure 3.1 Conflict Situation Matrix Under Risk. 


In this model, the sample space is the set of values allowed, for the proportion of 
antagonist actions that are peaceful. An event about which we are informed, as to 


Whether it will occur or not, 1s called a certain event. Otherwise, it is referred to as an 


uncertain event. [Ref. 8: p. 8] 


In this chapter we look first at the case where the decision maker is certain about 
the proportion of actions by the antagonist that are peaceful. In particular we define 8 
as the proportion of actions by the antagonist that are peaceful. Since the decision 
maker is certain as to the value of 8, this implies Pr(O = 6) = 1.0, where 6 is a 
realization of the random variable ©. In the notation used here. A, denotes a peaceful 
(cold-war) act and A, (its complement) denotes an aggressive (hot-war) act. The 
proportion of actions that are not peaceful is (1 - 9), since Ay 1s the only other act that 


can occur. The conflict situation matrix under risk is shown in Figure 3.1. 


A. DOMINANCE 

From the decision matrix it 1s clear that if a is greater than or equal to - 1.0, and 
b is less than or equal to + 1.0, then the decision maker will find P, optimal by 
dominance regardless of the value of 8. Similarly, when b is greater than or equal to 
+ 1.0, and a is less than or equal to - 1.0, Py, will be optimal, regardless of the value of 
§. In each case he will be indifferent when the equalities hold, again, regardless of the 
value of @. Regions when the values of a and b lead to dominance are seen in the 
shaded regions in Figure 3.2. 

Considering that the conflict situation matrix in this model sets the utility values 
of outcomes (P},Ap,) and (P.,A,) equal to - 1.0 and + 1.0 respectively, we find that 


there are 12 possible ordering of preferences. Five of these engenmess ine 


1) V(P.,A-) 2 V(Py-A-) 2 V(PoAp) 2 V(PL.An) 


Or 
+].0 2 b 2 a 2 Gao 
2) VIP Ane] V(PQA,) = Ph) ee) 
Or 
a 2 +10 >) - |e 4; 
3) V(P..A.) 2 VP. AL) 2B AD) = GP peers) 
Or 
+ 1.0 24 0 2 ape eee 
4) V(P..A,) 2 V(PoAp,) 2 V(Py Ap) 2 V(PH Ag) 
Or 
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Figure 3.2 Graph of Decision Rule Equation with Dominance Regions. 








5) W(P..Ap) 2 VW(P..A,) 2 V(P}-A,) 2 V(P}-Ap) 


Or 


IV 


dn? “ae b =, <= 120: 


These five preference patterns all have 


besa 10> ands aye. -1.0. 


They all will find alternative P, optimal by dominance. 
Only one of the remaining seven preference selections also exhibits the 


characteristic of dominance. This particular ordering of outcomes is dominated by an 
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aggressive response, alternative P,, for the protagonist. This preference pattern 1s 


6) V(PpA,) 2 ViPoAg) > V(PpAn) 2 V(PoAp) 
OF 


b 2 22 
and always has 
b 2 0) and wanes aed), 


This preference choice is where the decision maker would prefer 1) acting hot-war like 
While his antagonist was not, to 2) peaceful conditions which would be preferred, to 3) 
total war which would be preferred, to 4) being peaceful while the antagonist acted 
aggressively. In this case the decision maker would always find P, optimal by 
dominance. 

How 1s the protagonist to decide, when there is no dominance, what course of 
action is most advantageous to himself, given that the value of @ is a certainty? One 
basis of choice he might use 1s to select the alternative that gives him the greatest 


expected value of return. 


B. EXPECTED VALUE AS A BASIS FOR CHOICE 
The expected value from the conflict situation matrix for alternatives P., Is 
OF aie) 
and for alternative Py, 1s 


bO - (1 - 8). 


The decision maker would need only select the alternative with the highest expected 
value, that is, whichever of the two quantities above ts the maximum. Since the values 
from the conflict situation are utility values, the protagonist would be selecting the 
alternative with the highest expected malin 

A decision rule, for the options presented in this matrix, can be defined as the 


assignment of a particular course of action for any value of 8, remembering that @ is 


0 


the proportion of actions by the antagonist that are peaceful. We can create such a 
decision rule by setting the two possible decisions’ expected value quantities equal to 
each other. We find, after simplifving and rearranging, that we have the following 


linear relationship between a and b, 
a = b(@ 1 - 6)-(1; I - 8), (eqn 3.1) 


where (@: 1 - @) is the slope of the line and (1, 1 - 8) is its intercept. This relationship is 
a family of lines that passes through the point a = - 1.0 and b = + 1.0 for all values 


of 9, 0 < 8 < + 1.0. Equation 3.1 is plotted in Figure 3.2 for three values of 0. 


C. MAXIMIZING EXPECTED UTILITY IN THE NON-DOMINATED CASES 
We now turn to a discussion of the remaining six possible orderings that do not 


exhibit the characteristic of dominance. This is where the decision maker always rates 
a> - |.Osanemb > + 1.0 

or 
ieee Wand oO <9 + 1.0: 


Two examples of these preference patterns are 


7) V(PLA,) > W(PoA,) > V(PoAn) > V(Pp Ap) 


Or 
b ee ee) ee a > «Ff, 
and 
So) Vea) Pile )ee (epg) = VCP CAD ) 
or 


The decision maker will find in the non-dominated cases that expected utility will 
be maximized by choosing P.. if 


a> b(0/1-6)-(1) 1-89), (eqn si 
or by choosing Py, if 
a< bD(OY t=O) [Gre on (eqn 3.3) 


He will be indifferent when the equality holds (Equation 3.1). Equations 3.2 and 3.3 
gives the protagonist a well defined, unambiguous, decision rule that determines what 
action he should take, given a known value of @. In Chapter V we will look at how this 
type of decision rule can be used when the value of @ must be estimated. However, 
first we look at how to estimate the value of @ using conjectured a priori probability 
distributions and a likelihood function in Bayes Theorem to provide reasonable 


estimates of 9. 


t 
t 


IV. ESTIMATING THE UNCERTAINTY ABOUT THE PROPORTION OF 
ANTAGONIST ACTIONS THAT ARE PEACEFUL. 


Ultimately. the protagonist must choose an action based on his estimate as to 
which course of action the antagonist 1s pursuing. In the last chapter we discussed the 
case where the protagonist’s choice would be that course of action which gives him the 
greatest expected value of return. We also described the case where the proportion of 
actions by the antagonist that were peaceful was known, or that the value of @ was a 
certainty. What if protagonist has no information or at best limited information on the 
course of action the antagonist 1s pursuing? In this chapter we will see how to estimate 


the antagonist’s future behavior using Baves Theorem. 


A. ANTAGONIST ACTIONS AS BERNOULLI TRIALS 

One way for the protagonist to gather information in order to estimate 9 might 
be to observe the antagonist’s behavior and from these observations estimate what 
course of action he is pursuing. The two types of actions, (A, or Aj), wil be viewed 
as two mutuallv exciusive and exhaustive events, in the sense that every action (or 
behavior) observed can be categorized as peaceful or aggressive. [Ref. 9} A random 
variable 2a = 1 could then be defined, where: 


| O if an aggressive behavior is observed 

| 1.0 1f a peaceful behavior is observed 
and a is the n“ behavior observed. These observations of the behaviors can be viewed 
as the occurrence of a Bernoulli random variable with parameter 8. We would then 
have: 


ak, =) = ON eye (eqn 4.1) 


Our problem considers the case where there is also uncertainty as to the value of 
the parameter 9. The protagonist doesn’t know what course of action or mixture of 


courses of action will occur. He may, however, be willing to make some subjective 


guesses or conjectures as to what values of @ best describe the proportion of actions, 
by the antagonist, that are peaceful. This would make our Bernoulli trial, the 
Pr(X,, = 1), conditional on that proportion, 0. Therefore, we define the probability 
that the antagonist displays either peaceful or aggressive behavior, given that the 


proportion of actions that are peaceful (cold-war like) is 8, as: 


iosey es 3) = aye (eqn 4.2) 


B. A PRIORI PROBABILITY DISTRIBUTIONS ESTIMATE UNCERTAINTY 
The initial estimates about the proportion of antagonist’s actions that are 
peaceful, 8, maybe subjective on the part of the protagonist or may come from historic 


data. In the discrete case, we can generalize these initial estimates as: 
ie (08) a 15) 


For example, the protagonist might conjecture that the antagonist acts peacefully only 
a small proportion of the time and might represent his uncertainy about the value of 8 


AS 


Pr( © = .99) = .05 


and 


Free 


10) = .95. 
The probability distribution function of © has the properties, that the 
Pr(O = 6.) 2 0 and ), Pr(O = 8) = 1.0 
hOl fT tl eee Mine 0. are the possible values for the parameter. Both the discrete 


and the continuous cases will be considered in this thesis. In the continuous case, the a 


priori probability distribution will be denoted as f (8). 


For the discrete case only certain values for 0. in the interval (0.1) are allowed. 
The interval (0,1) is intentionally displayed as an open interval because attaching 
probabilities of 0 or 1.0 to uncertain events can lead to intractable conclusions 
suggested by Cromwell’s rule. This will be discussed in detail in the next section on 
Bayes Theorem. 

These probability functions represent the protagonist’s a priori estimates of 0. 
Before or without the introduction of more information, these a priori estimates of the 
probability distributions are the only estimates the protagonist has available in this 
model. However, after the protagonist observes the antagonist’s behavior, defined 
above as X,=1 where 1 = O or 1.0, a resulting @ posteriori probability distribution 


conditioned on the value of @ is given by Baves Theorem. 


C. BAYES THEOREM 
Ultimately we want to show that the conditional probability distribution being 


considered, the probability that the proportion of actions that are peaceful, or 


Pr( © = 86 ), given his most recently observed behavior is Sa = i, where 
ee OC eu ence of an aggressive behavior (hot-war) 
XN, = 1 - occurrence of a peaceful behavior (cold-war), 


1S given in this thesis as: 


Ooty Ne |.) 
Pr(Q=6.|X, =i) = ———_———.,_ r= ,2,3.... (eqn 4.3) 
2. PO = 8.) P(X, = i] 8.) 


[Ref. 10] We begin with the definition of conditional probability, which is given as: 


r n 
Oe DO ) 
BOs ail) 
In order to obtain the unconditional distribution of X, = 1, (the denominator) we 


invoke a widely used theory in the study of stochastic processes, that is, the Theorem 


of Total Probability, [Ref. 11] which states, that if PrN = ae 0.) for r = 1|,2,370mmS 
defined, then we have: 


Pr(Xn = i) = ). PHO 0) Peer sercee 


The conditional probability, or in Bayesian terms the likelihood of occurrence X, = 1, 
was defined in the section on Bernoulli trials (Equation 4.2) as 
pase i.) = Coal ag le (eqn 4.2) 


We can also shown from the definition of conditional probability that: 


Pr(Q = 6, N X, = i) = Pr(O = 6.) Pr(X, = i] 8). 


Substituting these into our definition of conditional probability, we have our 
original expression: 


Pr(O = 0.) gS = 0.) 
PO =k. =) = (eqn 4.3) 
oe 00) = reeds — fil Gh) 


After substituting in our likelihood function, Equation 4.2 we have the general form of 


the Bavesian conditional @ posteriori probability function used throughout this thesis: 


es) = Che ee" 
Pr) = 0-|X 1) ee (eqn 4.4) 
y, Pro = 6.) (8, (1-0)! 


This equation and its continuous case analogue, update the a priori probability 


function into an a posteriori probabilitv function that 1s conditioned on the proportion 


of actions that are peaceful by the antagonist. Equation 4.4 1s the conditional 
probability that the antagonists proportion of peaceful actions is 0, in the present 
conflict situation given that he has just recently been observed acting aggressively (for 
X, = 0). This is the result of the @ priori probability of the proportion of peaceful 
actions times the conditional probability that the antagonist acted aggressively in the 
past, given that his proportion of peaceful actions 1s ah divided by the sum of all the 
fyaris Iese events can Occur. 

The random Variable X,, will always be considered as a discrete random variable, 
but the a priori probability distributions may be discrete, Pr(QO = 0.). Or continuous, 


f (8). For the case where it is continuous, Baves Theorem is: 


f(@) (8)! (1 - yt 
f(@;X, =i) = —————___. (eqn 4.5) 
fence) (a) aera) 0 


The role of Baves Theorem 1s to establish a connection between the protagonist's 
a priori probability estimates, an expression of his uncertainty that the antagonist will 
act peacefully, Pr(Q = 9), and the effect that additional information or evidence will 
be to revise these probabilities. If the protagonist were to obtain perfect information, 
one of the probabilities would go to + 1.0 so that the Pr (O = 0) = + 1.0. This 
would be the one corresponding to the true distribution of @. The remaining 
probabilities would go to zero. Partial or less than perfect information, like that 
obtained through the model proposed here, produce a less marked effect on the a 
posteriori probabilities through the use of Baves Theorem. 

A result that follows from Baves Theorem is that it is inadvisable to attach 
probabilities of zero to uncertain events. Uncertain events of zero probability remain 
SO, nO matter what additional information 1s provided. Therefore, the decision maker 
must be aware of the possibility that he may be mustaken, for if he thinks that 
something cannot be true (e.g., peaceful intentions on the part of his antagonist) and 
interprets this as having zero probability, his decisions will never be influenced by any 
contradictory information which 1s surely absurd. This result is known in the study of 
Bavesian techniques as Cromwell's Rule. [{Ref. 8: p. 104] This same effect holds true. 
when a probability of 1.0 1s assigned, since the assignment of a probability of 1.0 to an 


uncertain event implies the probability of its complement is zero. 


tJ 
~~ 


Assuming probabilistic independence of the successive observations, the a 
posteriori probability after any set of observations becomes the a priori estimate for the 
next observation(s). It can also be shown that for any number of observations, the 


condiuonal likelihood function in Bayes Theorem is 
Pr(X,, =i 8,)=(8.) OG) ae Q.) (n=) Xp) (eqn 4.6) 


where 2 1s the total number of behaviors occurring, and we is the summation of the 
random variable NX, = i fori = 0 or 1.0. [Ref. 10] In this thesis we will deal with the 
behaviors one at a time, with specific examples worked out in the Chapter VII. 

In this chapter we introduced the concept of the antagonist’s exhibition of 
peaceful or not peaceful behavior as the realization of a Bernoulli random variable 
conditioned on the proportion of antagonist actions that are peaceful. This Bernoulli 
trial, combined with an expert’s conjectured a@ priori probability distribution, are used 
in Bayes Theorem to update this probability distribution into a current estimate of the 
antagonists intentions. In the next chapters we will take these @ posteriori probability 
distributions for the discrete and the continuous cases and see how they can be used to 


create decision rules based on maximuzing expected utility. 


V. DISCRETE CASE DECISION RULES WITH BAYES THEOREM 


In this chapter we will consider the discrete case for the @ priori probability 
distribution of the proportion of antagonist actions that are peaceful, 8. We will begin 
by showing a general decision rule for the decision problem represented by the conflict 
situation matrix. This will be followed by a discussion of how to use critical values and 
most likely values as a means of selecting 8. This chapter concludes with a section on 
using maximum expected utility values considering uncertain values of 8. Chapter VI 


will consider the continuous cases as the a priori probability distributions. 
A. A GENERALIZED DECISION RULE 
The decision rule from Chapter III for maximizing expected utility value, with 0 
certain, 1s to choose Re if 
aa DO V0) -{ 1/1 = 9), (eqn 3.2) 
and Py if 


a < (0; 1-6)-(1/ 1-98). (eqn 3.3) 


Returning to our original definition of expected value for the two alternatives, (P, or 


fan) in the conflict situation matrix we define 
0. + a(1- i) = E Ga208), 
and 
bO. - {1 - 0.) = E Ces) a 


mere. £ ec) is the expected utility value for alternative P., given any particular 


maime Of §. Also. the E Cee), is the expected uulity value for alternative P,. given 


anv particular value of 8. We can now generalize our decision rule to choose the 


alternative such that it is 
Max; E(P).0,) (eqn 5.1) 


Where j = c or h for any value of @.. 


B. CRITICAL AND MOST LIKELY VALUES 

The question that still remains, given that the decision maker has selected a 
preference pattern (and utility values for a and b), is what value to use for ae Tae 
protagonist might select a critical value that any estimates of @. should equal or exceed, 
and if he had the option, not consider a value of 8, until its probability of occurrence 
equalled or exceeded this stated critical value. Suppose the decision maker selected .95 
as his critical value, so that he would not select an alternative until any particular 
probability estiniate equalled or exceeded this value.! After the Pr(QO = 8.) equals or 
exceeds .95 he could substitute that value for @, into Equations 3.2 and 3.3, and based 
on whichever equation holds, choose the corresponding alternative. This changing of 
probabilities is best demonstrated with an example. 

Suppose, that the protagonist feels that the antagonist acts peacefully a small 
proportion of times and that he acts aggressively a large proportion of times, and that 


he suggests this @ priori probability distribution, such that 


Py (O 


9110) = 1/4 


and 


I 


Pr (O Poy =12 a) 

Now, suppose that the protagonist observes peaceful behavior on the part of the 
antagonist, or the value of X, = 1. Recalling Bayes Theorem (Equation 4.4) from 
Chapter bv: 


Pr(@ = 6.)(8,)'! (1-6)! 
| = 1 | -1 
y, PHO= 8 (05) (1-85) 
'¥t is conceded here that a third alternative, that of doing nothing is not included 


in the conflict situation matrix. The critical value discussion 1s predicated on the 
existence of this third alternative. 


and using the situation of our example we find 


(1 fea, 10) 
Pac = 9° RON 1) = ———— = (18 | 33) = 25S, 
Os) (oealOy (6 4) (14) 
and 
(3. 4)(1 4) 
PriO= | 1 1) = ———_—_— = (15 | 33) = 
ee) eel eo 10) 


As can be seen, none of the values of the @ posteriori probability distributions exceed 
our decision makers stated critical value of .95. So let's say he observes a second 
behavior, and this time he observes aggressive behavior by the antagonist, or X,, = 0. 
We again return to Baves Theorem to update our probability distribution, remembering 


that the @ priori distribution now was the a posteriori distribution above. We now have 


(18/33) (1 / 10) 


Pr(O@= 9 10| X,= 0) = 
(18/33) (1 / 10) + (15/33) (3 | 4) 


sh, 


and 
Gls so) (2.4) 


Gebes (ay.4) 418/33) (1) 10) 


The protagonist might continue taking observations of the antagonist’s behavior until 
one of the probabilities does actuallv exceed his stated critical value. He could then use 
this value of 0. in Equations 3.2 and 3.3 to choose his alternative. What if, after one 
Or more observations, the protagonist finds that none of the proposed probability 
distributions show estimates that exceed the critical value (as in the example above) and 
a decision must be made mow as to which course of action to pursue? This is the case 
when the conditional @ posteriori probabilities have not achieved a sufficient probability 
to exceed any stated critical value. The protagonist might then use the mos: likely 
value, Or the maximum Pr(@ = @,| X, = 1) and use this value of 0, in Equations 3.2 


and 3.3, to select the alternative with the maximum expected utility. Which, after the 
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first observation would mean that he would have used @ = .90, since its probability 
was the maximum, whereas, after the second observation he would have used the value 
of 8 = .25 as its probability of occurrence was the maximum in that instance. The 
decision maker could also use this criterion to select a value for Q. before the 


observation of any antagonist actions. In that case he would have selected 8 = .25. 


C. MANIMIZE EXPECTED UTILITY VALUE CONSIDERING ALL 
PROPOSED PARAMETER VALUES 


The procedures just described for selecting @ have a weakness since they do not 
use all the information available in the problem. If the protagonist is interested in 
maximizing his expected utility value relevant to all of his estimates of an antagonist s 
behavior. then he would not want to lose any information available to him. This would 
be the case in selecting the most likely and critical value criteria. Therefore, he would 
want to consider all possible parameter values proposed and the relative frequency with 
which thev can occur, and use the expected value of return.” 

[In the discrete case, the protagonist computes the expected value of return over 


0. if he uses course of action P., and he computes the expected value of return over 


C’ 
these same proposed values of Ge if he uses course of action Py. The course of action 
he uses corresponds to whichever of these two values 1s the maximum. This maximum 


value 1s 
Max; Wee (P;.8,) Pr(O = 6.| X, =i) (eqn 3.2) 


for j = c or h andr = 1,2... This is the Bayesian decision rule relative tomtom 
posteriori probability estimates. (Ref. 12] 

Returning to our discrete case example above, we have after two observations 
and assuming that the protagonist had selected the eighth listed preference pattern 


from Chapter ITT, 


V( Paar) > Me.) > Vira > Vee 
Or 


+ 1. > DW. 2 


“The term expected value of return is not to be interpreted as the value that will 
be obtained by the protagonist on any single occurrence but rather as the expected 
value of return due to the uncertainty of the value of @. 


BZ 


Tine Oana a — = 5-0. Im accordance with 5.2, we seek the maximum of 
peed) Pi On 8) | xX. = 1), 
or 


pe gece rere Cen = i) 


for both values of 8. Thus we have 
(-3.0)(1/10)(.138) + (-3)(3/4)(.862) = -1.98, 
Or 
(0 - 1.0)(.138) + (0 - 1.0)(.862) = -1.0. 


The protagonist would choose alternative Py. 

This procedure of selecting alternatives P. or Py is based on preference pattern 
selection, utility value assignment, Bayes Theorem and the use of the decision rule, 5.2. 
This decision rule can be used for any number of proposed values for the parameter he 
thus allowing the use of all the information available in the problem. In the next 
chapter we will look at the Bavesian decision rules that result from three different 


continuous case a priori probability distributions. 


VI. CONTINUOUS CASE DECISION RULES WITH BAYES THEOREM 


Rather than just considering the @ priori probability distribution to be finite, we 
would like to consider the case where there are an infinite number of values for 0. 
possible. In this case, the general Bayesian decision rule for the continuous case a 


priori probability function 1s to choose P; such that 
Max; {E (P;.) (Oeste (eqn 6.1) 


for j = cor h. There are three particular cases of continuous probability functions 
considered in this chapter, the first case being that all values in the range of 8 are 


equally hkelv. 


A. DECISION RULES USING A UNIFORM A PRIORI DISTRIBUTION 

The uniform case presents a good starting point for the protagonist if he has no 
information (or no idea) as to what proportion of actions, by the antagonist, are 
peaceful. The assumption that all possible values of @ are equally likely suggests an 
unbiased viewpoint, and when combined with Bayes Theorem, allows the distribution 
to weight future probabilities on observed behaviors alone. This can be seen by the 
form of the a posteriori probability distributions plotted in Figure 6.1. In this section 
we assume the protagonist uses a uniform a priori probability distribution, then we 
show the two a posteriori probability distributions that result after an observation of 
each tvpe of behavior. From these two updated probability distributions a generalized 
decision rule, based on the conflict situation matrix, 1s derived. 

The protagonist assumes that all values in the range of @ are equally likely and 
uses a uniform distribution as his a priori distribution. The general form of the uniform 


distribution 1s 
f£(8) = li(d-c), cSO<d. 
The limits for 8 are 0 and 1.0, so, the general form becomes 


f(8) = I(t - 0), 


= a 


Or 
F(®) = 1.0. 


The initial input into Bayes Theorem is this conjectured a priori probability 
distribution. The protagonist, next, observes the anatagonist’s behavior. There are two 
Becsibilities for this random variable, X) > 0 or + 1.0. Ii he observes aggressive 


behavior then X,, = 0 and the a posveriori distribution becomes 
NG) | ae 2 i (ean 6.2) 


This result can be seen by beginning with the general form of Baves theorem stated in 


Chapter IV as 
£(8) (8)! (1 - @y! 
(|X, =) = ———____, (eqn 4.5) 
fo £ (8) (8) ca - 8)! de 


where for the uniform case with i = 0 becomes 


(1.0)(1 - 6)! 
f(8| X, = 0) = ————————__ 
fg (1.0)(1 - 6)! dé 


Which simplifies to 


(CMP s@re = Oem 22 — 20°, (eqn 6.2) 


If the protagonist observes peaceful behavior on the part of the antagonist, then 


the random variable, X, = + 1.0. and the a posterior distribution becomes 


fae — Oe (200 (eqn 6.3) 


ore) 
(a 


a 


This result can be seen by beginning with the general form of Baves theorem which 


with 1 = + 1.0 becomes | 


(1.0)(8) it 
PO) Xo) oe | 
fg (1.0)(8) dé | 
which reduces to 
CG le eae la ee Ure (eqn 6.3) 


The effect that Bayes Theorem has on the a priori distribution can be seen in Figure 
6.1, where all three (the @ priori and the two resulting @ posteriori) distributions are 
plotted. 

Both of these @ posteriori probability distributions are legitimate probability 


distributions since | 


and 


1. Uniform Case Decision Rule for X, = 0 
The a@ = posteriori distribution that results after X,=0 is 
eh! y= 0) Se 20 (Equation 6.3). The decision rule is touichneer. P; that 
maximizes fg je (P;,8) f (@) d@, in accordance with 6.1, and we have the integral below 
to evaluate. The decision is to choose alternative P,., if the quantity on the leit exceeds 


the quantity on the right: 
fg (O-a® + a)(2- 20) > fg (@+bO - 1)(2 - 26) dO. 


After integration this decision rule, in the case for a uniform a priori and the 


observation of Seg becomes Select sik 


a> bee oe (eqn 6.4) 





aN) 


! f(B)1.2 
| 
| 
| 
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Figure 6.1 Uniform Case - A Prior: and A Posterior: Distributions. 


Or ae ve 
Ao ee (eqn o.)) 


This result can be seen after evaluation of the two integrals on either side of the 


Mreeguanty. The integral on the left is 
iv iO) .A0 eau = 20) du 
which equals 


(- 207 + 220° - da®@ + 20 + 2a) dO, 


oY 


which reduces to 

225 ee y oe 
The integral on the right is 

fg (8 + bO- 1)(2- 28) dO, 
which equals 
(48 + 2b@ - 2 - 207 - 2b07) dO , 

which reduces to 

b, 32'S. 


Use of these two values from our integrals brings us to our decision rules of choosing 
Po ila = bo = 3'2 07 Pp ou a <b ee 


2. Uniform Case Decision Rule with a = 1.0 


The a@ posteriori distribution that results after <. = Tage 


G4 ae = 1.0) = 20 (Equation 6.2). Following from the decision rule, 6.1, we have 
the integrals below to evaluate. Notice that only the a posteriori density function has 


changed. The decision maker would choose P, if 
Jg (8 - a8 + a) (28) d® > fy (8 + bO - 1) (28) dO, 
Which reduces to choosing P., if 


a > Deer (eqn 6.6) 





ONG ie if 
22 Dee 3 (eqn 6.7) 


This result can be seen bv comparing the value of the integral on the left to the value 


of the integral on the right. The integral on the left becomes 
2fg (8)? - (8)? + a(8) dO, 


which equals 


The integral on the right becomes 


24g (8)” + (8)? - (8) dO, 
which equals 
PD) eakbss 


These two values reduce to the decision rules of Equations 6.6 and 6.7 to choose 


alternative ae Mea =] 2a cr PE Ne << Pare 


B. DECISION RULES USING A BETA (a@ = 2, B = 2) A_ PRIORI 
DISTRIBUTION 


Another conceivable distribution that the decision maker might consider as best 
describing the antagonist’s behavior is a symmetric dome-shaped curve. This curve 
would have the mean equal to the mode, and be centered at 8 = .50. Such a curve 
would mean that the protagonist feels that most of the probability for @ is at the value 
.50. This would mean he feels it is most likely that half the actions of the antagonist 
are aggressive and half of them are peaceful, sloping off towards zero probability at the 
exireme values of 8. This kind of subjective feel for the a priori probability distribution 
can be captured by a Normal distribution, such that it is N ~ (.5, 67). However, the 
range of the Normal distribution is - © to + © and the range of the parameter. 9, is 
Oto + 1.0. A Beta distribution with parameters @=$>1 captures the essence of a 


svmmetric dome-shaped curve centered at .50, and it limits itself to the range of our 


Bie, 


random variable. The probability density function of a Beta distribution has the form: 


a) 
f(8) = = ——— _ (6)®-1(1-9)B-1 
P(a)P(B) 
For example, when a = f = 2.0, 
r(4) 
oe —— 
r(2)F (2) 


or 
F(9) = 6(9)(1 - 9). 
When this conjectured a priori probability distribution is used in Bayes theorem we get 


6(8)(1 - 8) (8)! (1 - 6)! 
f(@ IX, =i) = 


9 


6{q (8) (1 - 9) (8)! (1 - 6)! d6 
and when the observation of the antagonist’s behavior yields X,, = 0, this equals 
6(8)(1 - 8). 
6fg (8X1 - 8)- dé | 
The denominator is (6)(Beta(2,3)) density function which equals 6(1/12). The entire 
expression for the a posteriori probability distribution, with a conjectured a priori 


distribution of a Beta (2,2), and a subsequent observation of A, = 0, reducesite 


f(8| X, = 0) = 12(8\1- 9)°. (eqn 6.8) 
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If the subsequent observation of antagonist’s behavior 1s peaceful, or — = 1.0, 


then our formula from Bayes theorem becomes 


6(8)°(1 - 8) 
f(@|X, = 1.0) = —————___, 
6[g (8)°(1 - @) dé 


where the denominator is (6)(Beta(3,2)) densitv function which also equals 6(1/12), 


and we have 


f(8 |X, = 1.0) = 12(8)°(1 - 8). (eqn 6.9) 





icuneror2) Beta 2,2) Case - A Prion and A Posterior Distributions. 


4] 


The effect that Bayes Theorem has on the a priori distribution can be seen in Figure 
6.2 where all three (one a priori and two a posteriori) distributions for the Beta(2,2 


case are plotted. 


1. Beta (2,2) Case Decision Rule with Mae OO 


The decision. after observation of we = 0, 1s to choose alternative Pe if 
fg (0-28 + a)12(0)(1-8)- dO > fg (8+ bO-1)12(8)(1-8)° dO | 
and the decision rule reduces to selecting P. if 
do 2 ee oe (eqn 6.10) 
or Py if 
a =< Zhe 38 (eqn 6.11) 


This result can be seen by comparing the value of the integral on the left to the one on 


the right, where the integral on the left is 

12(1-a)fg (8)?(1-0)? d@ + 12afg (8)(1-6)? dé , 
and the two terms reduce ve 

12(1-a){P(3)0(3),P(6)} + laf P(2)P3)/T(5)}, 


which equals 
Safa 2 


The integral on the right becomes 


12(1 +b) fg (8)°(1-8)° dO - 12g (6)(1-8)° 48, 


yielding 
ieee) evans) RG = 12, F(2)0 (3) 8 (5)}, 


which equals 


and the two values reduce to the decision rules of Equations 6.10 and 6.11 to choose 
Peete ee Do ys Or Py il a = 2bi5"- 5) 3. 


2. Beta (2,2) Case Decision Rule with X, = 1.0 


The decision after observation of X, = 1.0 is to choose alternative P. if 
fg (O(1-a) + a)12(8)7(1-8) dB > fg (8(1 +b) - 1)12(6)7(1-6) 46 , 
and the decision rule reduces to selecting P., if 
ap D2 = 5.2, (eqn 6.12) 
ie, if 
aD 2215) 2) (eqn 6.13) 


This result can be seen by comparing the value of the integral on the left to the one on 


the right, where the integral on the left becomes 
12(1-a)fg (8)°(1-8) d@ + 12afg (8)*(1-6) dé, 
which reduces to 


12(1-a)[{(4)P(2)/1(6)] + 12a( (3) P(2)/P(S)], 
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which equals 
29/5 + 3/5. 
The integral on the right becomes 
12(1+b)fg (8)°(1-8) dO - 12fg (8)7( 1-6) dé, 
giving 
121+ BLE (4)E(2),0(6)} - 120 (312), 15, 


which equals 


These two values reduce to the decision rules, for the Beta(2,2) case and i = |, of 


Equations 6.12 and 6.13 to choose P. ifa > 3b/2- 5/2 or Py ifa < 3b/2 - 5,2. 


C. DECISION RULES USING A BETA (a@ = 1/2, B = 1/2) A PRIORI 
DISTRIBUTION 


So far, in the continuous cases, an equally likely a priori probability distribution 
(uniform) and a symmetric dome-shaped a priori probability distribution centered at 9 
= .50 (Beta 2,2) have been considered. What if the protagonist feels that the 
antagonist 1S an eXtremist type of thinker, in the sense that he has just as high a 
probability of acting peacefully as he does of acting aggressively and a low probability 
of an even split between the two? This would be the case if most of the probability 
Were weighted at the values of 8 = 0 and @ = + 1.0 witha very low probability of @ 
= .5. The essence of this tvpe of thinking could be captured with another @Ber 
distribution with @ = B < 1. This is a U-shaped probability density function centered 


again at @ = .50, but with most of the probability weighted at the extreme values of 0. 
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_ ee 


Here we will use a Beta function with a = $f = 1/2. The probability density function 


Giareectay |) 2.) 2) distribution has the form: 
T(1) 
f(8) = ————— 6)! * (1-6) "'*, 
rd 2ra®) 

Or 

£(0) = (limy@y! 21 - yh, 
monere 1 (1) (1;2)F (1 2) = 1/(./%-./T). 
When this conjectured a priori probability distribution is used in Baves theorem we get: 

(1x y(@)h 2 (1-6)? (6)! (1-6)! 


fg (1 m0)? (1-6)! (6)! (1-8)! a6 


If the observation of X, = 90, then, after cancelling the common factor of I'm, the a 


posteriori distribution becomes 


(@y! ? (1-6)! 
f(8|X, =0)= ———___—_, 
fg (0) 1 2 (1-0)!'2 ge 


where the denominator is 


which equals 


The a posteriori distribution for X,, = 0 becomes 


f(8| X, = 0) 


10 : 2.0 
Nie eo 2y_gyl7e | 
2a 2 
o| | =a) “Sf its 


Tay 72 (gy i 





Figure 6.5 Beta (1,2... 1,2) Case - A Priori and A Posterior Distributions: 


In the instance where the observation yields X,, = 1.0, Bayes formula gives 
(ny@yh = (1-ey ts? 
Ve uasiGhe= aah = aly 


and after cancelling the common factor of | 7, the a posteriori distribution becomes 


tv 


f(8| X. = 1.0) = ————————————__, 
pues gy l2 Q 
(5 (0). sale O) gama 
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(2'n)(Oy)/2 (y-ey!/2 | (eqn 6.14) 


_— ——- 


where the denominator 1s 


Tiseemrel 2) 
P(2) | 
or 
OM 
The a posteriori distribution for X,, = 1 becomes 


ACH tein Ee C179)(:) hee OCC) eae (eqn 6.15) 


The effect that Bayes Theorem has on the a priori distribution can be seen in Figure 


6.3 where all three (one a priori and two a posteriori) distributions are plotted. 


1. Beta (1/2.1/2) Case Decision Rule with X, = 0 
The decision after the observation of a = 01s to choose alternative ie if 


(2, m)fq{@(1-a) + a} ((8)!'2(1-6)! 2) dB > (2,m)fg(O(1 + b)-1} (0) !2(1-6)! 2 a9, 
which simplifies to choosing P. if 
aoe, 3 =a, (eqn 6.16) 
or Py if 
Aw DAS e-4 3 | (ea neo i?) 
This result can be seen by noting that the integral on the left 1s 


(1-a) wfg @'2(1-6)! “40 + 2anfy O°!'? (1-6)!/2 ae, 


47 


vielding 


(2(1-a)far] [[(3/2) (3/2) 8 (3) + [2a Oy 2 eee 
Or 
3a.4 = i: 


The integral on the right in our original expression is 


2(1 +b). mfg (8)! 21-8)! 2 d@ - (2/m)fg (8) 2(1-6)!/ a0, 


[2(1 + b) i'm] (F(3:2)1(3, 2), T(3)] - [2,0] (PF 1/2)0 (3/2) (2), 


OF 


b4 - 3,4. 


These two values reduce to the decision rules 


of Equations 6.16 and 6.17 to choose 
alternative P. if a > bid - 45 or Py il a < by3 = 4)3: 


2. Beta (1/2,1/2) Decision Rule with X, 


1.0 


The decision after observation of X, 1.0 1s to choose alternative ee if 


(2. fg £8(1-a)~ a} {(0)!/2¢1-8)"! 7) de > (2 mfg (01 +b)-1) (8)! 21-07! 4 a8 | 


which simplifies to choosing P, if 


(eqn 6.18) 


4$ 





Or in i 
A Wye (eqn 6.19) 


This result can be seen by comparing the value of the integral on the left to the one on 


the right. where the integral on the left is 
(2(1-a), m)fp (8)? (1-6) 1"? d8 + (2a:2)fq (0)!'2 (1-0)! a@, 
which evaluates to 
elev) (5, 2) 1/2), 1i3)}) arn) (1 (3/2)F 1/27 (2)}, 
vielding 
ara 14. 
The integral on the right in our original expression above is 
(2(1+b)/) fg 07'7(1-8)!'2 46 - (2/n) fg 0!/7(1-6)! 2 a0 
which reduces to 
Ole oy is 202) 39) | (2/r] (1 (3/2) 1/2);T(2)], 
vielding 
3b 4 - 1/4. 
These two values result in the decision rules of Equations 6.18 and 6.19 to choose P,, if 
pee - 2 or P, ifa < 3b- 2. 
Depending on his subjective feel for the antagonists intentions, or any 


historical data that provide a reasonable fit, the decision maker could choose of one of 


the three continuous a priori probability distributions presented in this chapter. The a 
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priori distribution selected and the subsequent behavior observed will determine which 
decision rule he will follow. A summary of the results of this chapter are presented in 
Appendix A. In Chapter VII we will show some specific examples for the cases 


presented here and in Chapter V. Chapter VIII will contain concluding remarks. 





VII. SPECIFIC EXAMIPLES 


In this chapter we will look at specific examples of the conjectured a priori 
probability distributions discussed in Chapters V and VI. There will be four cases for 
the a priori probability distributions (one discrete and three continuous) with eight 
examples. The discrete case will use four proposed values of 0. We will see how the 
behavior observed, through the use of Bayes Theorem, effects the alternative which 
maximizes the expected utility value. The three continuous case examples show how 
an observation of each tvpe of behavior effects the decision rule used and ultimately 
the alternative (P. or P:,) chosen. The examples used in this chapter will assume the 


following preference ordering of: 


V(PoAg) > V(PpAc) > V(PpAp) > V(PoAn) 


or 

oO Se tO. > Sale 
We also assume in this chapter that the utility values chosen for a = - 2.0 and for 
b= - 0.5. 


A. DISCRETE CASE WITH AGGRESSIVE BEHAVIOR OBSERVED 


We consider the instance of the following a priori probability distribution: 


Pr(®@ = 9'10) = (18) = .125 
Pr(®@ = 34) = (1/4) = 25 
Pr(@ = 14) = (14) = .25 
Pr(®@ = 1:10) = (3,8) = .375. 


Now, we suppose that the protagonist observes aggressive behavior (eee 0) on the 
part of the antagonist. Baves Theorem gives us for each of the possible values of 0. 
tinet 


Pr(@= 9/10 |X, =0) 


3] 


equals 


C1 2O)(1/8)} ¢ (CL. 10) 1/8) + (1 41/4) + (3,4)(14) + (9, 103, 8)} = (148) = .02. 


Simian 
(1'4)(1'4) 
PriQ= 34 pares) = ———— = (5/48) = .10, 
(48, 80) 
and 
(3/4)( 1/4) 
PriiQO= 14 Be = = (15/48) = 231, 
(48, $0) 
and 


(9: 10)(3/8) 
Pr(@= 1 10 |X,=0) = ———— = (27/48) = .56. 
(48,80) 


The course of action to choose in accordance with 5.2 is the maximum of 


II 
cD 


Yai - 8.) Pr(O Neuaaaia: 


r | n 


OT 


Y, (68, - 1) Pr(O 


ll 
&D 
~ 

Il 

~ 


OF the Maximum~or 


(-2.0) {(1/10)(1/48) + (1/4)(5:48) + (3/4)(15/48) + (9/10)(23/48)} = -1.53 





Or 
(-.5\(9 10)-1}(1 48) + ((-.5)(3/4)-11(5, 48) + ((-.5\(1'4)-1}(15/48) + £(-.5)(1/10)-1} (27 48) 


which equals - 1.12. The protagonist in this case would choose Pj, since -1.53 is less 


Godan =1.1 2. 


B. DISCRETE CASE WITH PEACEFUL BEHAVIOR OBSERVED 

This section uses the same conjectured probability distribution and utilitv values 
shown in the beginning of this chapter except with an observation of peaceful 
behavior, (X,, = 1.0). Bayes Theorem gives us for each of the possible values of 0, 


that 
Pr(O = 9:10 al) 
equals 


((1;8)(9'10)} J ((1)8)(910) + (1;4)(34) + (1/4)(1/4) + (3/8)(110)} = (9/32) = .28. 


Simuarly. 
(1,4)(3'4) 
Pr(O= 3:4 Esa 1.0) = —— = (15/32) = 47, 
(32 80) 
and 
(1 4) 4) 
PriO= 1/4 2 1.0) = —— = (5,32) = .16, 
(52 30) 
and 


(1 10)(1°8) 
Pr(O= 1.10 |X, = 1.0) = ———— = (3/32) = .09. 


We look for the maximum of 


(-2.0) (C1; 10)(9,;32) + (1/4)(13732) + (Q14)(3/32) - (9) 10i(G) 32 eee 


OF 





((-.5)(9) 10)-1}(9/32) + ((-.5)(3/4)- 1} (135,32) + ((-.5)(1;4)-1} (5732) + (-.5)C1/ 10)- 


Looe 


Given the above proposed a priori probability distribution and an observation of 


NX, = + 1.0, the protagonist would choose alternative P., since -./0 is greater than 
= 


3 
° Ol 


C. UNIFORM CASE WITH AGGRESSIVE BEHAVIOR OBSERVED 
In the umform case if the protagonist observes aggressive behavior then the 
decision rule (Equation 6.4) 1s to choose P. if 


a > (1/2)b - (3/2). 


Since a = - 2.0 and 6 = - .50, the the protagonist would choose P,. 


D. UNIFORM CASE WITH PEACEFUL BEHAVIOR OBSERVED 
The uniform case with an observation of peaceful behavior (X,, = 1.0) uses the 
decision rule derived in Chapter VI (Equation 6.6). This equation states to choose P,, if 


20277) 2 


The protagonist would choose P,, as his choice of action. 


E. BETA (2.2) CASE WITH AGGRESSIVE BEHAVIOR OBSERVED 
In the case of a Beta (2,2) as the proposed a priori probability distribution and 
an observation of aggressive behavior (X,, = 9) the decision rule (Equation 6.10) is to 


choose Ie it 


The protagonist would find that he is indifferent between alternatives P, and Py. 


F. BETA (2.2) CASE WITH PEACEFUL BEHAVIOR OBSERVED 
In the Beta (2,2) case with an observation of peaceful behavior the decision rule 


(Equation 6.12) is to choose P,, if 
aves ee 2D 15,2), 


In this case, substituting in the values for a and 6 will lead the protagonist to select P.. 


G. BETA (1/2,1/2) CASE WITH AGGRESSIVE BEHAVIOR OBSERVED 
A Beta (1 2,1 2) with X, = 0 as the observation yields Equation 6.16 as the 


decision rule. This decision rule states to choose ie if 
a 5155) b - (463). 


leading the protagonist to select Py, as his course of action. 


fee BETA (1/2,1/2) CASE WITH PEACEFUL BEHAVIOR OBSERVED 
Using a Beta (1.2,1/2) as the proposed a priori probability distribution, with an 
observation of X,, = 1.0, gives us the decision rule of Equation 6.18. The values of a 


and o, and the rule, 


cat 
Ca 


lead the protagonist to select alternative P.. 


The reader is reminded that while these specific examples appear to be definitive 
in their conclusions as to which course of action a protagonist should select, they are 
based on subjective assertions or historic data as to which probability distribution best 
describes the proportion of actions, by the antagonist, that are peaceful. These 


decision rules are only as good as the distributions they are based on. 








Can 
ON 


VIH. SUMMARY AND SUGGESTIONS FOR FURTHER STUDY 


This thesis has proposed a simple 2 x 2 decision under risk matrix as a model for 
@ecsien Making mM ail Unceriuin environment. This matrix is referred to as a conflict 
situation matrix. The 2 x 2 matrix design uses the two columns as two unknown states 
of nature and the two rows as the decision alternatives for a protagonist. The random 
MiemeswOl Nature are mterpreted as being either the Occurrence or non-occurrence of 
peaceful behavior on the part of an antagonist. The probability of occurrence of either 
state of nature is estimated through the use of Bayes Theorem. 

Lulity values. rated on an interval scale. are assigned to the four cells of the 
matrix. These utility values are used as payoffs in the conflict situation matnx. The 
estimated probabilities of occurrence of peaceful behavior of the antagonist are used to 
develop a Bayesian decision rule to maximize expected utility value for the protagonist. 
The Bayesian decision rules used are relative to the a posteriori probability distributions 
whereas the initial a priori probability distribution estimates are subjective or based on 
historic data. The Bayesian decision rules developed use discrete and continuous a 
priori probability distributions. The decision rule for the discrete case is to choose the 


alternative Fe or By that maximizes 


> 


, B(PsL) PO) = Gh See} 


where} = corhandr = 1,2.... This decision rule and its continuous case analogue 
Savs to select the alternative that maximizes expected utilitv relative to the @ posteriori 
Bayesian estimates. The three continuous case a priori probability distributions 
eecidered are the uniform, the Beta (2,2) and the Beta (1'2,1/2). The general form of 


the decision rule for these cases 1s to select the alternative Ie or ern that maximizes 
ee (P;.8) F(O|X, = i) de, 


fpamere | — ¢ or h and a is the continuous @ prior! probability distribution being 


considered. 
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The model proposed here provides reasonable decision rules that are nor counter- 
intuitive. The idea of categorizing all protagonist and antagonist behaviors (or actions) 
as either peaceful or aggressive when incorporated with the Bayesian decision rules 
work well, and make for a usable, easily understood model. This model and its 
decision rules may also provide a building block for larger models. An area that could 
provide room for expansion nught include a larger conflict situation matrix. If we 
allowed the protagonist and the antagonist additional alternatives, (e.g., no action 
taken. as suggested in in the fifth chapter). then we would open up the possibility for 
ann xX m_conflict situation matrix. 

Another additional feature that wasn’t considered here would be to let the 
decision maker rate all the outcomes in the matrix on an interval scale. This would 
surelv affect the decision rules with the addition of two more variables and would allow 
all combinations of preference patterns to be considered. 

Variance arguments for the a priori Beta probability functions, with @ equal to B, 
could also be incorporated. The mean of these density functions is .50 asp = aatf. 
The variance of a Beta distribution is given by o7 = aB.(a + 8)? (a + B +1). This 
fact could be used for the @ = PB > 1.0 case, that increasing the values of @ and f, 
while maintaining their equatitv, decreases the variance of the parameter and leaves the 
mean unchanged. This causes the distribution to weight more and more probability 
about the mean and less and less at the extremes. This characteristic of the Beta 
function cculd be exploited to reflect a degree of confidence by the decision makenum 
his a priori assumpuon of a Beta distribution with a = B > 1. Inthe a@ = B < 1.0 
case we would have. as @ = B > + 1.0, the 6” = 0, and this could also be used to 
reflect the decision makers confidence in his @ priori assumption of a Beta distribution 
with @ = B < 1.0. This particular characteristic of Beta distributions with a@ = 
could provide an area for further study. Other continuous a priori probability 
distributions, and particularly non-symmetric Beta distributions (a=), should also 
provide interesting further research. 

The purpose of this model is to analyze the probable future consequences of 
decision choices. The decision criteria used in the model proposed is highest expected 
uulity value. It is conceded that a simple model, such as this, and its suggested 
decision criteria cannot capture all the ramifications and consequences of political 
decisions that befall a decision maker at the national level. While 1t may be true that 


“models cannot substitute for reality and analysts cannot substitute for decision 
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makers”, {Ref. 13] it is hoped that this parsimonious, quantitative model will be able to 


provide an unbiased, objective input into the decision making process. 


APPENDIX A 
SUMMARY OF RESULTS FOR CONTINUOUS CASES 


RESULTS WITH 
UNIFORM A PRIORI 


A PRIORI PROBABILITY DISTRIB ie 
f(8) = 1 


A POSTERIORI for action X,, = i 





mee et | 
(aggressive) (peaceful) 


- 28 28 


bo 


Choose she if 


a> 1/2b-32 a> 2b-3 


Choose gn if 


a < ]/2b- 3/2 a < 2b- 3 


60 


RESULTS WITH 
BETA (2,2) A PRIORI 


A PRIORI PROBABILITY DISTRIBUTION 
Oe emer (2) lb 23) (0 CI=6) 


A POSTERIORI for action X,, = 


| amy Se aie 
(aggressive) (peaceful) 
12(6)(1-8)- 12(8)*(1-6) 


Choose ee if 


| a3 2b 3, 2 


Choose aN if 


6] 


RESULTS WITH 
BETA (1/2,1/2) A PRIORI 


A PRIORI PROBABILITY DISTRIBUTION 
6) = (T() / Fa.2) Fa, 2) y! 41-0)! 2 


A POSTERIORI for action X, = i 





Bd ape Dice aul 
(aggressive) (peaceful) 
| 1'274_qQ)-1/2 
? 7(0)! *(1-6)) 2 ait(O)°* (1-0) 


Choose ie Ve 


Choose ae if 


lsd 
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